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Abstract 



The notion of 'general covariance' is intimately related to the notion of 'back- 
ground independence'. Sometimes these notions are even identified. Such an 
identification was made long ago by James Anderson, who suggested to de- 
fine 'general covariance' as absence of what he calls 'absolute structures', a 
term here taken to define the even less concrete notion of 'background' . We 
discuss some of the well known difficulties that occur when one tries to give 
a precise definition of the notion of 'absolute structure'. As a result, there 
still seem to be fundamental difficulties in defining 'general covariance' or 
'background independence' so as to become a non-trivial selection principle 
for fundamental physical theories. 

In the second part of this contribution we make some historical remarks 
concerning the 1913 'Entwurf -Theory by Einstein and Grossmann, in which 
general covariance was first put to the fore, and in which Einstein presented 
an argument why Poincare-invariant theories for a zero-mass scalar gravita- 
tional field necessarily suffer from severe inconsistencies concerning energy 
conservation. This argument is instructive, even though — or because — it ap- 
pears to be incorrect, as we will argue below. 

This paper is a contribution to "An assessment of current paradigms in 
the physics of fundamental interactions", edited by I.O. Stamatescu (Springer 
Verlag, to appear). 
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1 Introduction 



It is a widely shared opinion that the most outstanding and characteristic feature 
of General Relativity is its manifest background independence. Accordingly, those 
pursuing the canonical quantization programme for General Relativity see the fun- 
damental virtue of their approach in precisely this preservation of 'background 
independence'. Indeed, there is no disagreement as to the background dependence 
of competing approaches, like the perturbative spacetime approach^ or string the- 
ory. Accordingly, many string theorists would subscribe to the following research 
strategy: 

"Seek to make progress by identifying the background structure in 
our theories and removing it, replacing it with relations which evolve 
subject to dynamical laws." ( 1221 p. 10). 

But what means do we have to reliably identify background structures? 

There is another widely shared opinion according to which the principle of 
general covariance is devoid of any physical content. This was first forcefully 
argued for in 1917 by Erich Kretschmann |14| and almost immediately accepted 
by Einstein |24j (Vol. 7, Doc. 38, p. 39), who from then on seemed to have granted 
the principle of general covariance no more physical meaning then that of a formal 
heuristic concept. 

From this it appears that it would not be a good idea to define 'background 
independence' via 'general covariance', for this would not result in a physically 
meaningful selection principle that could effectively guide future research. What 
would be a better definition? 'Diffeomorphism invariance' is the most often quoted 
candidate. What precisely is the difference between general covariance and diffeo- 
morphism invariance, and does the latter really improve on the situation? These 
are the questions to be discussed here. For related and partially complementary 
discussions, that also give more historical details, we refer to fT8l[T9l and yj re- 
spectively. 

As a historical remark we recall that Einstein quite clearly distinguished be- 
tween the principle of general relativity (PGR) on one hand, and the principle of 
general covariance (PGC) on the other. He proposed that the formal PGC would 
imply (but not be equivalent to) the physical PGR. He therefore adopted the PGC 
as a heuristic principle, guiding our search for physically relevant equations. But 
how can this ever work if Kretschmann is right and hence PGC devoid of any phys- 
ical content? Well, what Kretschmann precisely said was that any physical law can 
be rewritten in an equivalent but generally covariant form. Hence general covari- 
ance alone cannot rule out any physical law. Einstein maintained that it did if one 

' Usually referred to as the 'covariant approach', since perturbative expansions are made around a 
maximally symmetric spacetime, like Minkowski or DeSitter spacetime, and the theory is intended 
to manifestly keep covariance under this symmetry group (i.e. the Poincare or the DeSitter group), 
not the diffeomorphism group! 
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considers the aspect of 'formal simplicity'. Only those expressions which are for- 
mally 'simple' after having been written in a generally covariant form should be 
considered as candidates for physical laws. Einstein clearly felt the lack for any 
good definition of formal 'simplicity', hence he recommended to experience it by 
comparing General Relativity to a generally covariant formulation of Newtonian 
gravity (then not explicitly known to him), which was later given by Cartan HQ 
and Friedrichs fTTl and which did not turn out to be outrageously complicated, 
though perhaps somewhat unnatural. In any case, one undeniably feels that this 
state of affairs is not optimal. 

2 Attempts to define general covariance and/or 
background independence 

A serious attempt to clarify the situation was made by James Anderson f^lfT], 
who introduced the notion of absolute structure which here we propose to take 
synonymously with background independence. This attempt will be discussed in 
some detail below. Before doing this we need to clarify some other notions. 

2.1 Laws of motion: covariance versus invariance 

We represent space-time by a tuple (M,g), where M is a four-dimensional in- 
finitely differentiable manifold and g aLorentzian metric of signature (+,—,—,—). 
The global topology of M is not restricted a priori, but for definiteness we shall 
assume a product-topology M x S and think of the first factor as time and the sec- 
ond as space (meaning that g restricted to the tangent spaces of the submanifolds 
St := {t} X S is negative definite and positive definite along Mp := M x {p}. Also, 
unless stated otherwise, the Lorentzian metric g is assumed to be at least twice 
continuously differentiable. We will generally not need to assume (M, g) to be 
geodesically complete. 

Being a C°° -manifold, M is endowed with a maximal atlas of coordinate func- 
tions on open domains in M with C°° -transition functions on their mutual over- 
laps. Transition functions relabel the points that constitute M, which for the time 
being we think of as recognizable entities, as mathematicians do. (For physicists 
these points are mere 'potential events' and do not have an obvious individual- 
ity beyond an actual, yet unknown, event that realizes this potentiality.) Different 
from maps between coordinate charts are global diffeomorphisms on M, which 
are C°° maps f : M — ) M with C°° inverses f : M — > M. Diffeomorphisms 
form a group (multiplication being composition) which we denote by Diff(M). 
Diffeomorphisms act (mostly, but not always, naturally) on geometric objects rep- 
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resenting physical entities, like particles and fields.^ The transformed geometric 
object has then to be considered a priori as a different object on the same manifold 
(which is not meant to imply that they are necessarily physically distinguishable in 
a specific theoretical context). This is sometimes called the 'active' interpretation 
of diffeomorphisms to which we will stick throughout. 

Structures that obey equations of motion are e.g. particles and fields. Classi- 
cally, a structureless particle (no spin etc.) is mathematically represented by a map 
into spacetime: 

y:M^M, (1) 

such that the tangent vector-field y is everywhere timelike, i.e. (g(y,y) > 0). 
Other structures that are also represented by maps into spacetime are strings, mem- 
branes, etc. 

Afield is defined by a map from spacetime, that is, 

O : M ^ V (2) 

where V is some vector space (or, slightly more general, affine space, to include 
connections). To keep the main argument simple we neglect more general situ- 
ations where fields are sections in non-trivial vector bundles or non-linear target 
spaces. 

Let y collectively represent all structures given by maps into spacetime and 
O collectively all structures represented by maps from spacetime. Equations of 
motions usually take the general symbolic form 

.F[y,O,I]=0 (3) 

which should be read as equation for y , O given L. 

L represents some non-dynamical structures on M. Only if the value of L is 
prescribed do we have definite equations of motions for (y,0). This is usually 
how equations of motions are presented in physics: solve ^ for (y, O), given L. 
Here only (y, O) represent physical 'degrees of freedom' of the theory to which 
alone observables refer (or out of which observables are to be constructed). By 
'theory' we shall always understand, amongst other things, a definite specification 
of degrees of freedom and observables. 

The group Diff(M) acts on the objects (y,0) (here we restrict the fields to 
tensor fields for simplicity) as follows: 

(f ! y ) — > f • y := g o y for particles etc. , (4a) 

(f , O ) ^ f • O := D (f J o O o f-^ for fields etc. , (4b) 

^ For example, diffeomorphisms of M. lift naturally to any bundle associated to the bundle of linear 
frames and hence act naturally on spaces of sections in those bundles. In particular these include 
bundles of tensors of arbitrary ranks and density weights. On the other hand, there is no natural 
lift to e.g. spinor bundles, which are associated to the bundle of orthonormal frames (which are 
only naturally acted upon by isometrics, but not by arbitrary diffeomorphisms). 
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where D is the representation of GL(4,M) carried by the fields. In addition, we 
require that the non-dynamical quantities L to be geometric objects, i.e. to support 
an action of the diffeomorphism group. 

Definition 1. Equation Q is said to be covariant under the subgroup G C Diff (M) 
iff for all f e G 

F[y,0,I] =0 ^ F[f -y, f -a), f -I] =0. (5) 

Definition 2. Equation ^ is said to be invariant under the subgroup G C Diff (M) 
iff for all f G G 

F[t,0,I] =0 ^ F[f -y, f -O, I] =0. (6) 

Note the difference: in Definition|2l the non-dynamical structures L are the 
same on both sides of the equation, whereas in Definition^they are allowed to be 
also transformed by f G Diff(M). Covariance merely requires the equation to 'live 
on the manifold', i.e. to be well defined in a differential-geometric sense, whereas 
an invariance is required to transforms solutions to the equations of motions to 
solutions of the very same equation^ , which is a much more restrictive condition. 

As a simple example, consider the vacuum Maxwell equations on a fixed space- 
time (Lorentzian manifold (M, g)): 

dF = , (7a) 
d*F = 0, (7b) 

where F denotes the 2-form of the electromagnetic field and d the exterior differ- 
ential. The * denotes the (linear) 'Hodge duality' map, which in components reads 

*F|j,^ = ^^n-vccpF'^'^ , (8) 

and which depends on the background metric g through e and the operation of 
raising indices: F'^P := g°'^g'^^ F^^. The system Q is clearly Diff(M) -covariant 
since it is written purely in terms of geometric structures on M and makes perfect 
sense as equation on M. In particular, given any diffeomorphisms f of M, we have 
that f • F satisfies dTal iff F does. But it is not likewise true that d * F = implies 
d * f • F = 0. In fact, it may be shown"^ that this is true iff f is a conformal isometry 
of the background metric g, i.e. f • g = A g for some positive real- valued function A 
on M. Hence the system © is not Diff(M) -invariant but only G-invariant, where 
G is the conformal group of (M, g). 

^ In the mathematical hterature this is called a symmetry (of the equation). We wish to avoid the 
term 'symmetry' here altogether because that - in our terminology ~ is reserved for a further dis- 
tinction of invariances into symmetries, which change the physical state, and redundancies (gauge 
transformations) which do not change the physical state. Here we will not need this distinction. 

* This is true in 1+3 dimensions. In other dimensions higher than two f must even be an isometry 
of g. 



6 



2.2 Triviality pursuit 



2.2.1 Covariance trivialised (Kretschmann's point) 

Consider the ordinary 'non-relativistic' diffusion equation for the M-valued field 4> 
(giving the concentration density): 

9tct) = kAcI) . (9) 

This does not look Lorentz covariant, let alone covariant under diffeomorphisms. 
But if rewritten it in the form 

{n^V^-K(n^n^-gnV^V^}ct) = 0, (10) 

where g are the contravariant components of the spacetime metric (recall that we 
use the 'mostly minus' convention for its signature), is its covariant derivative, 
and is a normalized covariant-constant timelike vector field which gives the 
preferred flow of time encoded in ^ (i.e. on scalar fields 9t = n*^V|^). Equation 
([Tot has the form ^ with no y, O = c(), and L = (g^^, n^) and is certainly dif- 
feomorphism covariant in the sense of Definition^ The largest invariance group - 
in the sense of Definition|2l- is given by that subgroup of Diff (M) whose elements 
stabilize the non-dynamical structures L. We write 

StabDiff(M)(^) = {f G Diff(M) I f • I = 1} (11) 

In our case, Stab]5iff(M](g) the 10-parameter Poincare group. In addition, f sta- 
bilizes if it is in the 7-parameter subgroup M x E(3) of time translations and 
spatial Euclidean motions. 

This example already shows (there will be more below) how to proceed in 
order to make any theory covariant under Diff(M). As already noted, Diff(M)- 
covariance merely requires the equation to be well defined in the sense of differen- 
tial geometry, i.e. it should live on the manifold. It seems clear that any equation 
that has been written down in a special coordinate system on M (like Q) can also 
be written in a Diff(M)-covariant way by introducing the coordinate system - or 
parts of it - as background geometric structure. This is, in more modem terms, the 
formal core of the critique put forward by Erich Kretschmann in 1917 fl4l . 

2.2.2 Invariance trivialized 

Given that an equation of the form ^ is already G -covariant, we can equivalently 
express the condition of being G -invariant by 

F[y,(l),I] =0 ^ F[y,a), f -I] =0, Vf G G , (12) 

i.e. any solution of the equation parameterized by L is also a solution of the differ- 
ent equation parameterized by f • Z. Evidently, the more non-dynamical structures 
there are the more difficult it is to satisfy (I12t . In generic situations it will only be 
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satisfied if G = StabDiff(]y[)(Z). Hence, in distinction to the covariance group, in- 
creasing the amount of structures of the type L cannot enlarge the invariance group. 
The case of the largest possible invariance group deserves a special name: 

Definition 3. Equation ^ is called diffeomorpliism invariant iff it allows Diff(M) 
as invariance group. 

In view of (I12t . the requirement of Diff(M) -invariance can be understood as 
a strong limit on the amount of non-dynamical structure L. Generically it seems 
to eUminate any Z, i.e. the theory should contain no non-dynamical background 
fields whatsoever. Intuitively this is what background independence stands for. 
Conversely, any Diff(M)-covariant theory without non-dynamical fields is trivially 
Diff (M)-invariant. Hence it seems sensible to simply identify 'Diff(M) -invariance' 
and 'background independence', and this is what most working physicists seem to 
do. 

But this turns out to be too simple. The heart of the difficulty lies in our distinc- 
tion between dynamical and non-dynamical structures, which turns out not to be 
sufficiently sharp. Basically we just said that a structure (y or O) was dynamical 
if it had no a priori prescribed values, but rather obeyed some equations of motion. 
We did not say what qualifies an equation as an 'equation of motion'. Can it just 
be any equation? If yes then we immediately object that there exists an obvious 
strategy to trivialize the requirement of Diff(M)-invariance: just let the values of 
L be determined by equations rather than by hand; in this way they formally be- 
come 'dynamical' variables and no non-dynamical quantities are left. Formally 
this corresponds to the replacement scheme 

^ O' = (0,I), (13a) 

1 ^ I' =0, (13b) 

so that invariance now becomes as trivial as the requirement of covariance. 

More concretely, reconsider the examples Q and (fTOb above. In the first case 
we now regard the spacetime metric g as 'dynamical' field for which we add the 
condition of flatness as 'equation of motion': 

Riem[g] = , (14) 

where Riem denotes the Riemann tensor of (M, g). In the second case we regard 
g as well as the timelike vector field n as 'dynamical' and add (fT4li and the two 
equations 

g(n,n) = , (15a) 
Vn =0. (15b) 

In this fashion we arrive at diffeomorphism invariant equations. But do they really 
represent the same theory as the one we originally started from? For example, are 
their solution spaces 'the same'? Naively the answer is clearly 'no', simply because 
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the reformulated theory has — by construction — a much larger space of solutions. 
For any solution O of the original equations F[0, L] = 0, where L is fixed, we now 
have the whole Diff(M)-orbit of solutions, {(f • O, f • I) | f G Diff(M)} of the 
new equations, which treat L as dynamical variable. A bijective correspondence 
can only be established if the transformations f that act non-trivially on 1 (i.e. 
f ^ StaboiffjM] (I)) are declared to be gauge transformations, so that any two field 
configurations related by such a f are considered to be physically identical. 

If this is done, the simple strategy outlined here suffices to (formally) trivialize 
the requirement of diffeomorphism invariance. Hence defining background inde- 
pendence as being simple diffeomorphism invariance would also render it a trivial 
requirement. How could we improve its definition so as to make it a useful no- 
tion? This is precisely what Anderson attempted in |3|. He noted the following 
peculiarities of the reformulation just given: 

1. The new fields g or (g, n) obey an autonomous set of equations which does 
not involve the proper dynamical fields F or c|) respectively. In contrast, the 
equations for the latter do involve g or (g,rL). Physically speaking, the sys- 
tem whose states are parameterized by the new variables acts upon the sys- 
tem whose states are parameterized by F or cj), but not vice versa. An agent 
which dynamically acts but is not acted upon may well be called 'absolute' - 
in generalization of Newton's absolute space. Such an absolute agent should 
be eliminated. 

2. The sector of solution space parameterized by g or ( g , n) consists of a single 
diffeomorphism orbit. For example, this means that for any two solutions 
(4),g,n) and [^',g',n') of (flOt . (fT4l . and (fTSb there exists a diffeomor- 
phism f such that (g', n') = (f • g , f • n). So 'up to diffeomorphisms' there 
exists only one solution in the (g,n)— sector. This is far from true for c|): the 
two solutions c|3 and cf) ' are generally not related by a diffeomorphism. This 
difference just highlights the fact that the added variables really did not cor- 
respond to new degrees of freedom (they were never supposed to) because 
the added equations were chosen strong enough to maximally fix their values 
(up to diffeomorphisms). 

A closer analysis shows that the first criterion is really too much dependent on 
the presentation to be generally useful as a necessary condition. Absolute struc- 
tures will not always reveal their nature by obeying autonomous equations. The 
second criterion is more promising and actually entered the literature with some 
refinements as criterion for absolute structures. Before going into this, we will 
discuss some attempts to disable the trivialization strategies just outlined. 
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2.3 Strategies against triviality 

2.3.1 Involving the principle of equivalence 

As diffeomorphism covariance is a rather trivial requirement to satisfy, we will 
from now on only be concerned with diffeomorphism invariance. As we explained, 
it could be achieved by letting the Z's 'change sides', i.e. become dynamical struc- 
tures (y's and O's), as schematically written down in (fT3t . We seek sensible crite- 
ria that will limit the number of such renegades. A physical criterion that suggests 
itself is to allow only those Z to change sides which are known to correspond to 
dynamical variables in a wider context. For example, we may allow the spacetime 
metric g to become formally dynamical, since we know that it describes the grav- 
itational field, even if in the context at hand the self-dynamics of the gravitational 
field is not relevant and therefore, as a matter of approximation, fixed to some value 
(e.g. the Minkowski metric). Doing this would render the Maxwell equations © 
(plus the equations for g) diffeomorphism invariant. But this alone would not work 
for the diffusion equation, where n would still act as a non-dynamical structure. 

Hence we see that the requirement to achieve diffeomorphism invariance by at 
most adjoining g to the dynamical variables is rather non trivial and connects to 
Einstein's principle of equivalence. Let us quote Wolfgang Pauli in this context 
(II2TI. p. 181, his emphasis): 

"Einen physikalischen Inhalt bekommt die allgemeine kovariante For- 
mulierung der Naturgesetze erst durch das Aquivalenzprinzip, welches 
zur Folge hat, daB die Gravitation durch die g^^i^ allein beschrieben 
wird, und das diese nicht unabhangig von der Materie gegeben, son- 
dern selbst durch die Feldgleichungen bestimmt sind. Erst deshalb 
konnen die gti^ als physikalische Zustandsgrofien bezeichnet werden".^ 
( f21J . p. 181; the emphases are Pauli 's) 

2.3.2 Absolute structures 

As already remarked, another strategy to render the requirement of diffeomorphism 
invariance non-trivial was suggested by Anderson iT] by means of his notion of 
'absolute structures'. However, most commentators share the opinion that Ander- 
son did not succeed to give a proper definition of this term. Even worse, some feel 
that so far nobody has, in fact, succeeded in giving a fully satisfying definition. 

To see what is behind this somewhat unhappy state of affairs let us start with a 
tentative definition that suggests itself from the discussion given above: 

Definition 4 (Tentative). Any field which is either not dynamical, or whose solu- 
tion space consists of a single Diff(M)-orbit, is called an absolute structure. 

^ "The generally covariant formulation of the physical laws acquires a physical content only through 
the principle of equivalence, in consequence of which gravitation is described solely by the gik 
and these latter are not given independently from matter, but are themselves determined by field 
equations. Only for this reason can the gtk be described as physical quantities'^ ( HQi . p. 150). 
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In general terms, let S denote the space of solutions to a given theory. If the the- 
ory is Diff(M) invariant S carries an action of Diff(M]. The fields can be thought 
of as coordinate functions on S. An absolute structure is a coordinate which takes 
the same range of values in each Diff(M) orbit and therefore cannot separate any 
two of them. If we regard Diff(M) as a gauge group, i.e. that Diff(M) -related 
configurations are physically indistinguishable, then absolute structures carry no 
observable content. 

Following our general strategy we could now attempt to give a definition of 
'background independence': 

Definition 5. (Tentative) A theory is called background independent iff its equa- 
tions are Diff (M)-invariant in the sense of Definition|3]and its fields do not include 
absolute structures in the sense of Definition|4] 

Before discussing these proposal, let us look at some more examples. 

2.4 More examples 

2.4.1 Scalar gravity a la Einstein-Fokker 

In 1913, just before the advent of General Relativity, Gunnar Nordsrom invented a 
formally consistent Poincare-invariant scalar theory of gravity, a variant of which 
we will describe in some detail in the second part of this contribution.^ Its essence 
is the field equation ( l29l and the equation of motion (13 5 at for a test particle. Shortly 
after its publication it was pointed out by Einstein and Fokker that Nordstrom's 
(second) theory can be presented in a 'covariant' way. Explicitly they said: 

"Im folgenden soil dargetan werden, daB man zu einer in formaler 
Hinsicht vollkommen geschlossenen und befriedigenden Darstellung 
der Theorie [Nordstroms] gelangen kann, wenn man, wie dies bei der 
Einstein-Grossmannschen Theorie bereits geschehen ist, das invarianten- 
theoretische Hilfsmittel benutzt, welches uns in dem absoluten Differ- 
entialkalkiil gegeben ist".^ (|24|, Vol. 4, Doc. 28, p. 321) 

The essential observation is this: consider conformally flat metrics: 

gixv = 45^11 (16) 
then the field equation is equivalent to 

R[g] = 247tGg^n^^, (17a) 

* In fact, there are two related but inequivalent scalar theories by Nordsrom; see e.g. H6il . The one 
presented in part 2 is essentially equivalent to a theory sketched by Otto Bergmann in 1956 O, 
which Harvey 1 12 1 classified as a modification of Nordstroms first theory. 

' "In the following we wish to show that one can arrive at a formally complete and satisfying pre- 
sentation of the theory [Nordstrom's] if one uses the methods from the theory of invariants given 
by the absolute differential calculus, as it was already done in the Einstein-Grossman theory". 
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where R[g] is the Ricci scalar for the metric g, whereas the equation of motion for 
the particle becomes the geodesic equation with respect to g: 

+ r^pX«xP = . (17b) 

Now, the system (fT/l . considered as equations for the metric g and the trajectory 
X, is clearly Diff(M)-invariant. But Nordstroms theory is equivalent to (fTTb plus 
(fT6l . Here rj is a non-dynamical field so that (I17ll6t is only Diff(M)-covariant. 
According to the general scheme outUned above this could be remedied by letting 
the metric rj be a new dynamical variable whose equation of motion just asserts its 
flatness: 

Riem[ri]=0. (18) 

But then rj qualifies as an absolute structure according to Definition|4]and the the- 
ory (I17ll6ll8t is not background independent. The subgroup G C Diff(M) that 
stabilizes rj is — by definition — the inhomogeneous Lorentz group, which had al- 
ready been the invariance group of Nordstroms theory. So no additional invariance 
has, in fact, been gained in the transition from Nordstrom's to the Einstein-Fokker 
formulation. 

Sometimes the absolute structures are not so easy to find because the theory is 
formulated in such a way that they are not yet isolated as separate field. For exam- 
ple, in the case at hand, (fT6b and dlSt together are clearly equivalent to the single 
condition that g be conformally fiat, which in turn is equivalent to the vanishing of 
the conformal curvature tensor for g (Weyl tensor): 

Weyl[g]=0. (19) 

The field rj ^-^ has now disappeared from the description and the theory does not 
explicitly display any absolute structure anymore. But, of course, it is still there; 
it is now part of the field g. To bring it back to light, make a field redefinition 
Qyx-v ^ (4^, h.^-^,) which isolates the part determined by (fT9l : for example 

4) := [-det{g^^}]^ , (20) 
h-ixv := g^A' [-det{g^^}] t. (21) 

Then any two solutions for the full set of equations are such that their component 
fields h,|j^ and h,^^ are related by a diffeomorphism. Hence h^^ is an absolute 
structure. 

Clearly there is a rather non-trivial mathematical theory behind the last state- 
ment of diffeomorphism equivalence of h,|j^. We could not have made that state- 
ment had we not already been in possession of the full solution theory for (fT9b 
which, after all, is a complicated set of non-linear partial differential equations of 
second order. 
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2.4.2 A massless scalar field from an action principle 

Usually we require the equations of motion to be the Euler-Lagrange equations for 
some associated action principle. Would the somewhat bold strategy to render non- 
dynamical structures dynamical by adding by hand 'equations of motion' which fix 
them to their previous values also work if these added equations were required to 
be the Euler-Lagrange equations for some common action principle? The answer is 
by no means obvious, as the following simple example taken from L23J illustrates: 
Consider a real massless^ scalar field in Minkowski space: 

n<^:=rf^V^V^i^ = 0. (22) 

According to standard strategy the non-dynamical Minkowski metric rj is elimi- 
nated by introducing the dynamical variable g, replacing rj in d22b by g, and adding 
the flatness condition 

Riem[g] = (23) 

as new equation of motion. Is there an action principle whose Euler-Lagrange 
equations are (equivalent to) these equations? This seems impossible without in- 
troducing yet another field A (a Lagrange multiplier) whose variation just yields 
d23l . The action would then be 

S - ^ 

where the symmetries of the tensor field A are that of the Riemann tensor: 

Variation with respect to (() and A yield (122 1 and d23l respectively, and variation 
with respect to g gives 

V^V^A'^^P^ = T'^P , (26) 

where T"P is the energy-momentum tensor for cf). These equations do not give 
a background independent theory for the fields ((t),g,A) since g is an absolute 
structure. The solution manifold of the ^ field is, in fact, the same as before. For 
this it is important to note that there is an integrability condition resulting from 
(I26l23t . namely V aJ°^^ = 0> which is however already implied by d22b . Hence no 
extra constraints on c|) result from d26l . 

However, the A field seems to actually add more dimensions to the solution 
manifold and hence to the observable content of the theory. Indeed, using the 
Poincare Lemma in fiat space one shows that any divergenceless symmetric 2- 
tensor T*^"^ can always be written as in d26b . where A has the symmetries d25l . But 
this does not fix A'^*^'^, so that the set of Diff(M)-equivalence classes of stationary 
points of d24b is strictly 'larger' than the set of solutions of d22b . In other words, 
the (Diff(M) reduced) phase space for the theory described by (l24li is 'larger' then 
that for (l22l .^ A a result we conclude that the reformulation given here does not 

* This is just assumed for simplicity. The arguments works the same way if a mass term were 
included. 

' I am not aware of a reference where a Hamiltonian reduction of <24t is carried out. 



dVA*P^*^^R 



(24) 
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achieve an equivalent Diff (M)-invariant reformulation of d22l in terms of an action 
principle. 

2.5 Problems with absolute structures 

A first thing to reaUze form the examples above is that the notion of absolute struc- 
ture should be slightly refined. More precisely, it should be made local in order 
to capture the idea that an absolute element in the theory does not represent lo- 
cal degrees of freedom. Rather than saying that a field corresponds to an absolute 
structure if its solution space consists of a single Diff(M) -orbit, we would like to 
make the latter condition local: 

Definition 6. Two fields T-\ and T2 are said to be locally diffeomorphism equiva- 
lent iff for any point p G M there exits a neighbourhoods U of p and a diffeomor- 
phism (^u : U — > U such that 4)u • (Ti |^) = Ti]^. 

Note that local diffeomorphism equivalence defines an equivalence relation on 
the set of fields. Accordingly, following a suggestion of Friedman L9J, we should 
replace the tentative Definition|4]by the following 

Definition 7. Any field which is either not dynamical or whose solutions are all 
locally diffeomorphism equivalent is called an absolute structure. 

In fact, this is what we implicitly used in the discussions above where we 
slightly oversimplified matters. For example, any two fiat metrics gi , 92 (i-C- which 
satisfy Riem[gi 2] = 0) are generally only locally diffeomorphism equivalent. 
Likewise, a conformally fiat metric g (i.e. which satisfy Weyl[g]=0) is locally 
diffeomorphism equivalent to f^, where f is non-vanishing function and rj is a 
fixed fiat metric. 

Having corrected this we should also adapt the tentative Definition|5j 

Definition 8. A theory is called background independent iff its equations are 
Diff(M)-invariant in the sense of Definition|3land its fields do not include absolute 
structures in the sense of Definition0 

So far so good. Is this, then, the final answer? Unfortunately not! The standard 
argument against this notion of absolute structure is that it may render structures 
absolute that one would normally call dynamical. The canonical example, usually 
attributed to Robert Geroch (T3\ . makes use of the well known fact in differential 
geometry that nowhere vanishing vector fields are always locally diffeomorphism 
equivalent (see e.g. Theorem 2.1.9 in |1|). Hence any diffeomorphism invariant 
theory containing vector fields among their fundamental field variables cannot be 
background independent. For example, consider the coupled Einstein-Euler equa- 
tions for a perfect fluid of density p and four-velocity u in spacetime with metric 
g. This system of equations is Diff(M) -invariant. By definition of a velocity field 
we have g(u, u) = c^. This means that u cannot have zeros, even if for physical 
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reasons we would usually assume the fluid to be not everywhere in spacetime, i.e. 
the support of p is a proper subset of spacetime. Then the four velocity of the 
fluid is an absolute structure, contrary to our physical intention. 

I know of two suggestions how to avoid this conclusion in the present example. 
One is to use the 1-form u^dx^ rather than the vector field u^9^ as fundamen- 
tal dynamical variable for the fluid. The point being that one-form fields are not 
locally diffeomorphism equivalent. For example, a closed (exact) one-form field 
will always be mapped into a closed (exact) one-form field, and hence cannot be 
locally diffeomorphism equivalent to a non-closed field. Another suggestion, in 
fact the only one that I have seen in the literature ( llTOll p. 59 footnote 9 and l25l . 
p. 99, footnote 8) is to take the energy-momentum density 11 rather than u as fun- 
damental variable. To be sure, on the support of Ff we can think of it as equal to pu, 
but on the complement of its support there is no need to define a u. This avoids the 
unwanted conclusion whenever TT indeed has zeros; otherwise the argument given 
above for u just applies to IT. 

An even simpler argument, which I have not seen in the physics literature, 
even applies to pure gravity. It rests on the following theorem from differential 
geometry, an elegant proof of which was given by Moser (T5\ : given two compact 
oriented n-dimensional manifolds Vi and V2 with n-forms ]x-\ and ]X2 respectively. 
There exists an orientation preserving diffeomorphism cj) : Vi — > V2 such that 
4)* 10-2 = iff the |Xi -volume of Vi equals the [0.2- volume of V2, i.e. iff 



M-1 

V, 



^2 . (27) 



If we take Vi = V2 to be the closure of an open neighbourhood U in the space- 
time manifold M, this theorem implies that the metric volume forms, written in 
coordinates as 

det[g(9^,9^)]| dx^ A--- Adx^, (28) 

are locally diffeomorphism equivalent iff they assign the same volume to U. Hence 
it follows that the metric volume elements modulo constant factors are absolute 
elements in pure gravity. Note that this implies that for any metric g any any point 
p G M there is always a local coordinate system {x^} in an open neighbourhood U 
of p such that det[g(9m 9-v)]| = 1 ■ 



It seems a little strange to be forced to consider velocity fields u in regions where p — 0, i.e. where 
there is no fluid matter. Velocity of what? one might ask. In concrete applications this means that 
we have to extend u beyond the support of p and that the physical prediction is independent of 
that extension. 
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3 A historical note on scalar gravity 



In his contribution ("Physikalischer Teil") to the 'Entwurf Paper' (|24|, Vol.4, 
Doc. 13), that Einstein wrote with his lifelong friend Marcel Grossmann^\ Einstein 
finished with §7 whose title asks: "Can the gravitational field be reduced to a 
scalar ?" ("Kann das Gravitationsfeld auf einen Skalar zuriickgefiihrt werden ?"). 
There he presented a Gedankenexperiment-based argument which apparently shows 
that any Poincare-invariant'^ scalar theory of gravity, in which the scalar gravita- 
tional field couples exclusively to the trace of the energy-momentum tensor, nec- 
essarily violates energy conservation and is hence physically inconsistent. This he 
presented as plausibility argument why gravity has to be described by a more com- 
plex quantity, like the g^^ of the 'Entwurf Paper', where he and Grossmann con- 
siders 'generally covariant' equations for the first time. After having presented his 
argument, he ends § 7 (and his contribution) with the following sentences, showing 
that his conviction actually derived on some form of the PGC: 

"Ich muB freilich zugeben, daB fiir mich das wirksamste Argument 
dariir, daB eine derartige Theorie [eine skalare Gravitationstheorie] zu 
verwerfen sei, auf der Uberzeugung beruht, daB die Relativitat nicht 
nur orthogonalen linearen Substitutionen gegeniiber besteht, sondern 
einer viel weitere Substitutionsgruppe gegeniiber. Aber wir sind schon 
desshalb nicht berechtigt, dieses Argument geltend zu machen, well 
wir nicht imstande waren, die (allgemeinste) Substitutionsgruppe aus- 
findig zu machen, welche zu unseren Gravitationsgleichungen gehort".'^ 
r l24l . Vol. 4, Doc. 13, p. 323) 

Einstein belief, that scalar theories of gravity are ruled out, placed him — in 
this respect — in opposition to most of his contemporary physicist who took part in 
the search for a (special-) relativistic theory of gravity (Nordstrom, Abraham, Mie, 
von Laue ..). Some of them were not convinced, it seems, by Einstein's inconsis- 
tency argument. For example, even after General Relativity was completed. Max 
von Laue wrote a comprehensive review paper on Nordstroms theory, thereby at 
least implicitly claiming inner consistency |26|. 

On the other hand, modem commentators seem to fully accept Einstein's claim 
and view it as important step in the development of General Relativity |[T6lfl7l 

" Marcel Grossmann wrote the "Mathematischer Teil". 

By 'Poincare group' we shall understand the inhomogeneous SL(2, C), i.e. the semi-direct product 
R"* X SL (2, C), defined by the multiplication law (q, A) (b,B) = (a + 7t{A)b, AB), where tt : 
SL(2,C) — > S0(l,3)o (the identity component of S0(1 , 3)) is the 2-1 covering homomorphism. 
The phrase 'Poincare-invariance' is always taken to mean that the equations of motion admit the 
Poincare group as symmetry group, i.e. it transforms solutions to solutions of the very same 
equation. 

" To be sure, I have to admit that in my opinion the most effective argument for why such a theory 
[a scalar theory of gravity] has to be abandoned rests on the conviction that relativity holds with 
respect to a much wider group of substitutions than just the linear-orthogonal ones. However, we 
are not justified to push this argument since we were not able to determine the (most general) 
group of substitutions which belongs to our gravitational equations. 
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and possibly also as an important step towards the requirement of general covari- 
ance. From a modem field theoretic viewpoint, however, the claim of violation 
of energy conservation of a Poincare -invariant theory sounds even paradoxical, 
since Noether's theorem guarantees the existence of a conserved quantity associ- 
ated to the symmetry of time-translations. This quantity is usually identified with 
energy (or even taken as definition of energy). Hence Einstein's argument can- 
not be entirely obvious. It even becomes intrinsically incorrect if placed within a 
straightforward scalar theory of gravity, as will be shown below. 



3.1 Einstein's argument 



shaft 



Einstein first pointed out that the source for the gravitational field must be a scalar 
built from the matter quantities alone, and that the only such scalar is the trace of 
the energy-momentum tensor (as pointed out to Einstein by von Laue, as Einstein 
acknowledges, calling Jj^ii the "Laue Scalar"). Moreover, for closed stationary sys- 
tems the so-called Laue-Theorem states that the integral over space of T^^"^ must 
vanish, except for |a. = = "v; hence the space integral of T/f equals that of T°°, 
which means that the total (active and passive) gravitational mass of a closed static 
system equals its inertial mass. However, if the system is not closed, the weight de- 
pends on the stresses (the spatial components T^'), which Einstein deems unaccept- 
able. 

His argument proper is then as follows: consider a box 
B filled with electromagnetic radiation of total energy 
E. We idealize the walls of the box to be inwardly per- 
fectly mirrored and of infinite stiffness, i.e. they can 
support normal stresses (pressure) without any defor- 
mation. The box has an additional vertical strut in the 
middle connecting top and bottom walls, which sup- 
ports all the vertical material stresses that counterbal- 
ance the radiation pressure, so that the side walls merely 
sustain normal and no tangential stresses. The box can 
slide without friction along a vertical shaft, S, whose 
cross section corresponds exactly to that of the box. The walls of the shaft are 
likewise idealized to be inwardly perfectly mirrored and of infinite stiffness. The 
whole system of shaft and box is finally placed in a homogeneous static gravita- 
tional field, g, which points vertically downward. Now we perform the following 
process. We start with the box being placed in the shaft in the upper position. Then 
we slide it down to the lower position; see Fig. 1. There we remove the side walls 
of the box — without any radiation leaking out — such that the sideways pressures 
are now provided by the shaft walls. The strut in the middle is left in position 
to further take all the vertical stresses, as before. Then the box together with the 
detached side walls are pulled up to their original positions. Finally the system is 
reassembled so that it assumes its initial state. Einstein's claim is now that in a 
very general class of imaginable scalar theories the process of pulling up the parts 
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Figure 1: Lowering 

the box in the gravi- 
tational field with side 
walls attached. 



Figure 2: Raising the 

box in the gravitational 

field with side walls 
taken off. 



needs less work than what is gained in energy in letting the box (with side walls 
attached ) down. Hence he concluded that such theories necessarily violate energy 
conservation. 

Indeed, radiation plus box is a closed static system. Hence the weight of the 
total system is proportional to its total energy E, which we may pretend to be given 
by the radiation energy alone, since the contributions from the rest masses of the 
walls will cancel in the final energy balance, so that we may formally set them 
to zero at this point. Lowering this box by an amount H in a static homogeneous 
gravitational field of strength g results in an energy gain of AE = hgE/c^. So 
despite the fact that radiation has a traceless energy-momentum tensor, trapped ra- 
diation has a weight given by E/c^. This is due to the radiation pressure which 
puts the walls of the trapping box under tension. Tension makes an independent 
contribution to weight, independent of the material that supports it. For each par- 
allel pair of side-walls the tension is just the radiation pressure, which is one third 
of the energy density. So each pair of side-walls contribute E/3c^ to the (passive) 
gravitational mass (over and above their rest mass, which we set to zero) in the 
lowering process when stressed, and zero in the raising process when unstressed. 
Hence, Einstein concluded, there is a net gain in energy of 2E/3c^ (there are two 
pairs of side walls). 

But it seems that Einstein neglects the fact that, in contrast to the lowering pro- 
cess, during the lifting process the state of the shaft S is changed. Moreover, the 
associated contribution to the energy balance just renders Einstein's argument in- 
conclusive. Indeed, when the side walls are first removed in the lower position, the 
walls of the shaft necessarily come under stress because they now need to provide 
the horizontal balancing pressures. In the raising process that stress distribution 
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of the shaft is translated upwards. But that does cost energy, even though it is not 
associated with any proper transport of the material the shaft is made from. As 
already pointed out, stresses make their own contribution to weight, independent 
of the nature of the material that supports them. In particular, a redistribution of 
stresses in a material immersed in a gravitational field will generally makes a non- 
vanishing contribution to the energy balance, even if the material does not move. 
This is explicitly seen in the model theory discussed next. 



3.2 A formally consistent model-theory for scalar gravity 

We wish to construct a Poincare-invariant theory of a scalar gravitational field, O, 
coupled to matter. We will use Lagrangian methods. Regarding the Minkowski 
metric we use the 'mostly minus' convention, that is, r|^^ = diag(l , — 1 , — 1 , — 1 ). 

We start from the obvious generalization of Poisson's equation, AO = AuGq, 
with the 'Laue-scalar' as source: 

□O = -kT , where k := AnG /c^ . (29) 

Here □ = r| ^^9^^9-1, and T = ^^^l^-^. T^^ is the stress-energy tensor of the matter 
(sign-normalization: Too = Tq = +energy-density). We seek an action which 
makes ( l29l its Euler-Lagrange equation. It's easy to guess^*^: 



'field + Sint — 7773 



1 



d\ (^9^09*^0 - kOT) . (30) 



where Sfieid, given by the first term, is the action for the gravitational field and Sint, 
given by the second term, accounts for the interaction with matter. 

To this we have to add the action Smauer for the matter, which we only specify 
insofar as we we assume that the matter consists of a point particle of rest-mass 
rao and a 'rest' that needs not be specified further for our purposes here. Hence 
Smatter = Spaiticie + Srom (rom = rcst of matter) where 



_ 2 
'particle — — TTLoC 



dT. (31) 



The quantity dT = ^Ti^^dz^^dz'^ is the proper time along the worldline of the 
particle. The energy-momentum tensor of the particle is given by 



T^^^(x) = moc 



z^^(t)z^(t) 6'4)(x-z(t)) dT, (32) 



so that the particle's contribution to the interaction term in (I30t is 



'int-particle — TTlQ 



(D(z(T))dT. (33) 



Note that O has the physical dimension of a squared velocity, k that of length-over-mass. The 
prefactor 1 / kc^ gives the right hand side of <30t the physical dimension of an action. The overall 
signs are chosen according to the general scheme for Lagrangians: kinetic minus potential energy. 
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Hence the total action can be written in tlie following form: 



moc^ 



1 

+ 



d\{^d^,Od^O - K(l)T,om) ^^^^ 



By construction the field equations that follow from this action are given by 
( l29l . where the energy momentum-tensor refers to the matter without the test par- 
ticle, if we treat the latter as test particle. The equations of motion for the test 
particle are then given by 

= P^^^a^d) , (35a) 
where P^'^ = r^^'^ - zH'^ /c^ (35b) 
and 4) = c^ln(l + O/c^) . (35c) 

Two things are worth remarking at this point: 

• The term P^"^ is a projector perpendicular to the timelike direction given 
by z. It is necessary in order to avoid overdetermination. Due to z*^Z|j^ = 
there can only be three independent equations of motion. Indeed, an equation 
like = d^(^ immediately leads to the integrability condition z^9^4) = 0, 
which renders this equations useless since it says that ^ may not change 
along the worldline of the particle. 

• Whereas O plays the analog of the Newtonian potential in the SR-adapted 
field equation d29l . it is ^ rather than O that plays the analog of the New- 
tonian equation potential in the equation of motion for a test particle. The 
relation between the two potential is given by (I35cl) . We were not free to 
just impose an equation of motion for the test particle, in which ^ in (I35at is 
replaced by O. Rather, (I35at is an unambiguous consequence of the consis- 
tency requirement, according to which all forms of matter couple to gravity 
in the same fashion, namely via the OT - term in the interaction Lagrangian. 
From d32b via d33b this directly leads to d35t . 

Suppose there exists some inertial coordinate system x*^ with respect to which 
O (and hence ^) is static, i.e. 9oO = 0, then in these coordinates (I35al) is equiva- 
lent to the following 3-vector equation (t = x°) 

fez(t) = - fl - \^z(t)\^/cA V(p(z(t)) . (36) 



From Einstein's own recollections we know that he also arrived at an equation 
hke (I36t in an early attempt to generalize Newton's scalar theory of gravity, but 
that he dismissed it for not satisfying some variant of the universality of free fall, 
according to which the vertical acceleration of a body should be independent of the 
horizontal velocity of its center of mass. In his own words: 
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"Dieser Satz, der auch als Satz iiber die Gleichheit der tragen und 
schweren Masse formuliert werden kann, leuchtete mir nun in seiner 
tiefen Bedeutung ein. Ich wunderte mich im hochsten Grade iiber 
sein Bestehen und vermutete, dass in ihm der Schliissel fiir ein tief- 
eres Verstandnis der Tragheit und Gravitation liegen miisse. An seiner 
strengen Giiltigkeit habe ich auch ohne Kenntnis des Resultates der 
schonen Versuche von Eotvos, die mir - wenn ich mich richtig erin- 
nere - erst spater bekannt wurden, nicht ernsthaft gezweifelt."'^ (LSTl. 
pp. 135-136) 

Concerning this statement, at least three things seem truly remarkable: 

• That Einstein would dismiss the quadratic dependence of the vertical accel- 
eration on v/c, as predicted by d36l . as "not in accord with the 'old experi- 
ence' (sic!) of the universality of free fall". 

• The dependence of the vertical acceleration on the horizontal center-of-mass 
velocity is clearly expressed by d36b . However, Einstein's additional claim 
that there is also a similar dependence on the internal energy does not survive 
closer scrutiny. One might think at first that (I36t also predicts that, for ex- 
ample, the gravitational acceleration of a box filled with a gas decreases with 
temperature, due to the increasing velocities of the gas molecules. But this 
arguments neglects the walls of the box which gain in stress due to the ris- 
ing gas pressure. According to (l29l more stress means less weight. In fact, 
a general argument due to Laue (1911) shows that these effects precisely 
cancel (see e.g. II 711 for a lucid discussion). 

• Einstein's requirement that the vertical acceleration should be independent 
of the horizontal velocity is (for good reasons) not at all implied by the 
modern formulation of the (weak) equivalence principle, according to which 
the worldline of a freely falling test-body (without higher mass-multipole- 
moments and without charge and spin) is determined by its initial spacetime 
point and four velocity, i.e. independent of the further constitution of the test 
body. In contrast, Einstein's requirement relates two motions with different 
initial velocities. In fact, it is badly in need of a proper interpretation to 
even make physical sense. Are we to require that two bodies dropped from 
some altitude, one with the other without horizontal initial velocity, reach the 
ground simultaneously? What what does 'simultaneously' refer to? Simul- 
taneously in the initial rest frame of one of the two bodies? Or at the same 
lapse of eigentimes of the two bodies? 

"These investigations, however, led to a result which raised my strong suspicion. According to 
classical mechanics, the vertical acceleration of a body in the vertical gravitational field is inde- 
pendent of the horizontal component of its velocity. Hence in such a gravitational field the vertical 
acceleration of a mechanical system or of its center of gravity comes out independently of its 
internal kinetic energy. But in the theory I advanced, the acceleration of a falling body was not 
independent of its horizontal velocity or the internal energy of the system. This did not fit with the 
old experimental fact that all bodies have the same acceleration in a gravitational field." 
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In passing we remark that d36l gives rise to a periastron precession of — 1 /6 times 
tlie value obtained from GR. 



3.3 Energy conservation 

Corresponding to Poincare-invariance there are 10 conserved currents. In partic- 
ular, the total energy E relative to an inertial system is conserved. For a particle 
coupled to gravity it is easily calculated and consists of three contributions corre- 
sponding to the gravitational field, the particle, and the interaction-energy of parti- 
cle and field: 



2kc2 

2 



d\{(dctOf+[V(Df), (37a) 
Hparticie = raoc^y(v) , (37b) 

Hinieracion = TaQ T ( v) O (z( t ) , t) , (37c) 

where v = |dz(t)/dt| (the velocity of the particle w.r.t. the inertial system) and 
y(v) = 1 /a/T — v^/c^. This looks all very familiar. 

3.4 Energy-momentum conservation in general 

Let's return to general matter models and let T^'^ be the total stress-energy tensor 
of the gravity-matter-system. It is the sum of three contributions: 



where^^ 



' total ' gravity i ' matter ' interaction ) V-^ ^/ 



Tr..y = -^(9^03-0 - ^Ti^^-dAOa^O) , (39a) 

KC 

XilTer — depending on matter model , (39b) 

T^acon =il^"(0/c^)T„,,,„. (39c) 



Energy-momentum-conservation is expressed by 

where ^Ztami the four-force of a possible external agent. The 0-component of it 
(i.e. energy conservation) can be rewritten in the form 



external power supplied = — 

at 



'(otal ' 

D 



T°^n^da. (41) 



If the matter system is of finite spatial extent, meaning that outside some bounded 
spatial region D we have that Tjfi^er vanishes identically, and if we further assume 

" We simply use the standard expression for tiie canonical energy-momentum tensor, which is good 
enough in the present case. IfS = jLdtd^x, it is given by T^^ := (91/90), ^,)0,-v -SiJL. 
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that no gravitational radiation escapes to infinity, the surface integral in S4T\> van- 
ishes identically. Integrating dTTT i over time we then get 



external energy supplied 



+ AE. 



nteraction ) 



(42) 



with 



-interaction 



d\ (0/c2)T,„ 



(43) 



D 



and where A (something) denotes the difference between the initial and final value 
of 'something' . If we apply this to a process that leaves the internal energies of 
the gravitational field and the matter system unchanged, for example a processes 
where the matter system, or at least the relevant parts of it, are rigidly moved in the 
gravitational field, like in Einstein's Gedankenexperiment of the 'radiation-shaft- 
system' , we get 



external energy supplied = A 



d\ (0/c2)T„, 



D 



(44) 



Now my understanding of what a valid claim of energy non-conservation would be, 
is to show that this equation can be violated, granted the hypotheses under which 
it was derived. This is not what Einstein did (compare Conclusions). 

If the matter system stretches out to infinity and conducts energy and momen- 
tum to infinity, than the surface term that was neglected above gives a non-zero 
contribution that must be included in (l44l) . Then a proof of violation of energy 
conservation must disprove this modified equation. (Energy conduction to infinity 
as such is not in any disagreement with energy conservation; you have to prove that 
they do not balance in the form predicted by the theory.) 



3.5 Conclusion 

For the discussion of Einstein's Gedankenexperiment the term d43l is the relevant 
one. It accounts for the weight of stress. Pulling up a radiation-filled box inside 
a shaft also moves up the stresses in the shaft walls that must act sideways to bal- 
ance the radiation pressure. This lifting of stresses to higher gravitational potential 
costs energy, according to the theory presented here. This energy was neglected by 
Einstein, apparently because it is not associated with a transport of matter He in- 
cluded it in the lowering phase, where the side-walls of the box are attached to the 
box and move with it, but neglected them in the raising phase, where the side walls 
are those of the shaft, which do not move. But as far as the 'weight of stresses' is 
concerned, this difference is irrelevant. What (l43l tells us is that raising stresses 
in an ambient gravitational potential costs energy, irrespectively of whether it is 
associated with an actual transport of the stressed matter or not. This would be just 
the same for the transport of heat in a heat conducting material. Raising the heat 
distribution against the gravitational field costs energy, even if the material itself 
does not move. 
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I conclude that Einstein's argument is not convincing. Clearly this is not meant 
to give any scientific support to scalar theories of gravity (as opposed to GR), which 
we know are ruled out by experiment. For example, as already mentioned above, 
the model theory discussed here gives the wrong amount (even the wrong sign) for 
the perihelion shift of Mercury, namely —1/6 times Einstein's value. Moreover, 
theories in which the gravitational field couples to matter via its trace of the energy- 
momentum tensor predict a vanishing global deflection of light. But what is not the 
case is that scalar theories are intrinsically inconsistent, as apparently suggested by 
Einstein. For Einstein this argument might have appeared as a convenient physical 
way to rule out scalar theories, whose primary deficiency he saw, however, in the 
lack of being generally covariant. 
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